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Abstract 

This paper presents a study made to automatically 
recognize lesion-like regions in black skin medical 
images. Skin lesions have consistently had one of the 
most rapidly increasing incidences of all cancers. Early 
diagnosis is particularly important but it is a challenging 
task, especially for black populations. Moreover, black 
skin specialists are often limited to the use of classic 
macroscopic images for diagnosis purpose. All these 
difficulties are related to black skin pigmentation level 
and the small visual differences between lesion parts and 
healthy ones. 

We propose here a computerized method which identifies 
automatically lesion regions with more accuracy and 
efficiency. It works like an automaton that traverse lesion 
images and automatically classifies healthy regions from 
lesions regions. The designed classifier is a Multi-Layer 
Perceptron Artificial Neural Network (MLP-ANN) 
trained with color and texture features. We made many 
combinations of features and varied the number of 
neurons in hidden layer, in order to obtain best 
performances. Nine features (six of texture and three of 
color) have been retained to train the network. The 
achieved classification performance is 97.2% in both 
training, validation and testing set. 1 

Keywords: Medical images, Black skin, Neural Network, 
Lesion recognition, Texture features. 

Nomenclature 

ANN 
CAD 
MLP 

MLP-ANN 

CADS 
PPV 
NPV 
TPR 
SPC 


Artificial Neural Network 
Computer Aided Diagnosis 
Multi-Layer Perceptron 
Multi-Layer Perceptron type Artificial 
Neural Network 

Computer Aided Diagnosis System 
Positive Predictive Value or Precision 
Negative Predictive Value 
True Positive rate or Sensitivity 
Specificity or True Negative Rate 


1 This study has been implemented on Matlab Neural 
Network Toolbox software at LETIA lab. University of 
Abomey-Calavi in Republic of Benin 


LPR 

false Positive Rate 

ACC 

Accuracy 

LP 

false Positive 

LN 

false negative 

TP 

True Positive 

TN 

True Negative 


1. Introduction 

Dermatology is one of the fields where medical images 
have been used for years, since their potential benefits 
had been revealed in 1992 by Stoecker and Moss [1]. 
However, many parameters such as: type and quality of 
images, noise, or acquisition techniques influence image 
processing. About that, black skin images are certainly 
one for which difficulties occur most. In fact, while 
several studies proved the effectiveness of dermoscopic 
images based diagnosis [2-5], dermoscopy is ineffective 
in black skin and then unusable by black skin specialists. 
Therefore, these specialists are limited to the use of 
standard digital cameras as acquisition devices. Once 
images are acquired, major difficulties appear regarding 
their processing. 

Indeed, in black skin, there is the presence of pigment 
throughout the basal layer of epidermis [6]. Therefore, 
the approach with black population is quite different, 
even if diagnostic procedure is generally the same, 
regardless of skin color [7]. So, it is important to make 
research in the direction of setting up Computer Aided 
Diagnosis (CAD) systems specific to black skin. In this 
context, we proposed in a previous paper, a method to 
analyze black skin macroscopic medical images. In that 
work, we focused on preprocessing and segmentation 
using a combination of mathematical morphology and 
edge detection [8]. The method improved edge detection 
but had also shown its limits. So, further studies deserved 
to be undertaken in order to develop a more efficient 
system. 

With the significant advances occurred during recent 
years in the field of computer science and artificial 
intelligence, medical images processing has become 
more reliable and accurate [9]. Some good applications 
can be found in [10], [11] [12] and [13]. Artificial Neural 
Networks (ANNs) in particular, have been use to solve 
many image processing problems, as reported in [14], [15] 


1 


Graphics, Vision and Image Processing Journal, ISSN 1687-398X, Volume 16, Issue 3, ICGST LLC, Delaware, USA, Dec. 2016 


and [16]. Among them, we can enumerate: applications 
of ANNs in pre-processing step [17-19]; ANNs trained to 
perform segmentation task [20-22]. ANNs have also been 
used in medical diagnosis as main stage of a CADS [23]. 
Despite these advances, there exist very few studies on 
the subject of black skin, as regards the development of 
CADS. The typical pipeline adopted for automated skin 
lesion diagnosis is: image acquisition, artifact detection, 
lesion segmentation, feature extraction and classification 
[24]. In this paper, we propose a neural network based 
classifier acting in the artifact detection stage. It has been 
trained with color and texture features and works like an 
automaton that traverse black skin lesion images and 
automatically differentiates healthy regions from lesions 
regions. 

The remainder of the paper is organized as follows: 
Section (2) discusses the theory about feedforward type 
neural networks; Section (3) emphasizes on the material 
and the methods used in this work; In Section (4) we 
present the results and make some discussions and finally, 
we conclude in Section (5). 

2. Theory 

An Artificial Neural Network is a computational system 
inspired by the structure, the processing method and the 
learning ability of a biological brain. It is characterized 
by a large number of very simple neuron-like processing 
elements, a large number of weighted connections (links) 
between these elements and a distributed representation 
of knowledge over the connections [25]. Knowledge is 
acquired by the network through a learning process [26]. 
ANNs’ expansion started in 1943 when McCulloch and 
Pitts proved that neuron can have two states and that 
those states could be dependent on some threshold value 
[27]. Since then, ANNs have been widely used in 
research because they can model highly non-linear 
systems in which the relationship among the variables is 
unknown or very complex [28]. A review of various 
classes of neural networks can be found in [29] and [30]. 
Multilayer perceptrons (MLPs) are the most popular type 
of neural networks in use today. They belong to a general 
class of structures called feedforward neural networks, a 
basic type of neural network capable of approximating 
generic classes of functions, including continuous and 
integrable functions [31]. 

MLP Structure: In MLP structure, neurons are grouped 
into layers. The first and last layers are called input and 
output layers respectively, because they represent inputs 
and outputs of the overall network. The remaining layers 
are called hidden layers. Typically, an MLP neural 
network consists of an input layer, one or more hidden 
layers, and an output layer, as shown in Figure 1. 
Mathematical background: Suppose the total number of 
layers is L. The 1 st layer is the input layer, the I th layer is 
the output layer, and layers 2 to L— 1 are hidden layers. 
Let the number of neurons in I th layer 
be N e , t - 1,2,...,L and w*. represent the weight of the 

link between j th neuron off- X th layer and i th neuron of 
I th layer, !</<N i _ l , and l<i<N e . Let x* represent the i th 
external input to the MLP, and z. be the output of i th 


neuron of I th layer. An extra weight parameter w* 0 is 
introduced for each neuron, representing the bias for i th 
neuron of i th layer. As such, w of MLP includes w\. , 

j=0,l,..., N t _ l9 i=l,2,..., N t , l=2,3,...,L. w expression is 
given by equation (1): 

W = [wf 0 W 2 U W 2 2 ... W L NlNl i J (1) 


Layer L 
(output layer) 


Layer L - 1 
(hidden layer) 




Figure 1. Multilayer perceptron (MLP) Neural Network structure 

In a neural network, each neuron with the exception of 
neurons at the input layer receives and processes stimuli 
(inputs) from other neurons. The processed information is 
available at the output end of the neuron. A neuron of 

thef^ layer receives stimuli from neurons of l -X th layer, 
that is: h [ x , h l ' 2 ,..., . Each input is multiplied by the 

corresponding weight parameter, and resulting products 
are added to produce a weighted sum y i (cf. equation 3). 

This weighted sum is passed through a neuron activation 
function <J to produce the final output h. of the neuron 

(cf. equation 2). This output represents the hypothesis (or 
assumption) of the neuron, and it can become the 
stimulus for neurons in the next layer: 

h!=c 7ir!) ( 2 ); = ( 3 ) 

J = 0 

Learning process: The purpose of ANN is to learn to 
recognize patterns in data. Once the neural network has 
been trained on samples data, it can make predictions by 
detecting similar patterns in future data. The process by 
which MLP neural network achieves learning or training 
is a two steps: feedforward and backpropagation. 

In the feedforward process, the external inputs are first 
fed to the input neurons 1 st layer, the outputs from the 
input neurons are fed to the hidden neurons of the 2 nd 
layer, and so on, and finally the outputs of L-\ th layer are 
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fed to the output neurons L th layer. The difference 
between the resulting outputs and the expected ones is 
calculated to get the error. In backpropagation, errors that 
result from previous step are propagated back through the 
system, causing the system to adjust the connection 
weights. The training begins with random weights and 
the goal is to adjust them until the error is minimal. This 
is achieved by minimizing the objective function E of 
equation (4): 

i m 

E = ~Y J (K-y i ) 2 ( 4 ) 

^ i = 1 

More details about ANN learning process is available in 
[26]. 

3. Material and methods 

Material: We implemented our programs and algorithms 
with MATLAB, Version 8.4 (R2014b). We designed, 
implemented and trained the network with the MATLAB 
neural network toolbox. Our data set includes 40 medical 
images which were taken in same conditions (cf. Table I). 


Table I. Images Database Composition 


Types of Images 

Number 

Erythema lesion 

17 

Tuberous Sclerosis Complex (TSC) lesion 

6 

Purpura lesion 

5 

Seborrheic Keratosis lesion 

7 

Eczema lesion 

5 

Images showing black people skin 

20 


These images were taken with a HD digital camera in the 
Dermatology service of the Hubert K. Maga University 
Hospital of Cotonou. Figure 2 shows samples from each 
type of skin lesion. 


(a) 

(b) 

(c) 

(d) 

(e) 

(0 

(g) 

(h) 

(i) 

5 

Mm 


Figure 2. Some black skin lesions of our image database ( a: Purpura 
lesion; c - j : Eczema lesion; e: TSC lesion; g: Seborrheic Keratosis; 
b - d - f - h - i : different types of Erythema) 


We also downloaded images showing black people skin, 
from different online databases. From all these images, 
we extracted blocks of pixels of size 3x3, thanks to a 
program. Each block represents an example of the dataset 
(see figure 3 for extraction procedure). At the end, we 
obtained about 800.000 learning examples that we 


divided into learning, validation and test sets, in the 
respective proportions of 60%, 20% and 20%. 



i 


Figure 3. Regions extraction to blocks of pixels constitution 

Methods: Normal skin regions are homogenous 

connected groups of pixels. So, color and texture 
information can be used to describe them. Fifteen 
features were identified to train the NN, but nine were 
finally retained. So the designed network has nine 
neurons in input layer and one in output layer. We varied 
the number of neurons in the hidden layer until we get 
the best performances. The input layer neurons 
correspond to features calculated for each example (3x3 
pixel block). There are three color descriptors (one for 
each RGB color channel) and six texture descriptors. 

Color features: We calculate the mean intensity value for 
each of the R, G and B channels (Mr, Mg and Mb) with 
formula (5). 

• for each block X (3,3), 

(5). k = {R,G,B} 

* i = 1 

Texture features: The retained texture features are: 
uniformity, standard deviation, skewness, kurtosis, 
smoothness, and entropy. They were chosen for their 

simplicity and efficiency. Let z. represent the value of a 
pixel x(i) ; features are defined as follows: 

• Uniformity: it is maximum when all gray levels 
are equal (maximally uniform). 

U=f j p 2 (z i ) (6) 

/ = 1 

• Standard deviation: measure of average contrast. 

a = Y J ( z i~ m j 2 p{.Zi) ( 7 ) 

i = 1 

• Skewness: it is the measure of the asymmetry of 
the intensity values around the mean intensity. 

' s ' = ZL _m )TU (8) 

i=i 

• Smoothness: its value is 0 for regions of constant 
intensity and approaches 1 for regions with large 
excursions in the values of its intensity levels. 
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r = 1 - 


(9) 

• Kurtosis: it is the measure of how outlier-prone a 
distribution is. 


ZE ~ m Yp( z i) 


(10) 


Entropy: a measure of randomness. 


e =-Y J p{ z M o %ip( z i) (u) 


With: 


1 the number of possible intensity levels; 

Zi a random value of pixel intensity; 
p(zi) the histogram of intensity levels in a 
region around Zi; 

and m the average intensity defined by: 


m 


Z z < p ( z <) (I 2 ) 


Network parameters: Activation function is the sigmoid 
function given by the formula (13). 

v(Y)=—-— ( 13 ) 

\+e r 

Sigmoid function is a smooth switch function as shown 
by the curve of Figure 4. It has the property described by 
expression (14). 


oir)- 


1 asy^>+ oo 
0 asy^-cc 


( 14 ) 



Figure 4. Sigmoid function. 

Functioning of the proposed Technique: the designed 
classifier works like an automaton that traverses black 
skin lesion images and is able to differentiate healthy 
regions from lesions regions. So, when an image is input, 
the first step consists in dividing it into blocks of pixels 
of size 3x3. Then, features are calculated for each block 
and the equivalent function of the classifier is used to 
calculate the output. When the decision of the classifier is 
close to 1, all pixels values are replaced by 1. If, it’s the 
contrary, values are replaced by 0. Thus, at the end we 
obtain a binary rebuilt image (cf. flowchart of figure 5). 
The corresponding pseudocode of all this process is: 
VARIABLES Inputlmage, F, Features [9] , NbOfBlocks, 
Blocks [], Outputlmage. 

PREDEFINED FUNCTIONS CalcFeatures, ComputeNN, 
ExtractB locks, 

BEGIN 

READ Inputlmage 

DO [NbOfBlocks, Blocks] = ExtractBlocks(Inputlmage) 


count = 0 

WHILE count < NbOfBlocks 
count = count +1 

Features = CalcFeatures(Blocks(count)) 
F=C omputeNN (F etaures) 

If (0.5<F<=1) THEN 

Blocks(count)=l 

ELSE 

Blocks(count)=0 

ENDIF 
ENDWHILE 
WRITE Outputlmage 
END 


c 


START 



READ 
COLOR 
IMAGE FILE 


7 , 

EXTRACTION OF 
BLOCKS OF PIXELS : 
NbOfBlocks determined 

/ > 






COUNT =0 




COMPUTE OUTPUT 
VALUE F WITH 
TRAINED MLP-ANN 


CALCULATE FEATURES 
VECTOR FOR CURRENT 
BLOCK OF PIXELS 



Figure 5. Flowchart of the proposed recognition technique 

Evaluation metrics: We used six metrics to evaluate the 
performance of our classifier: Sensitivity, Specificity, 
Accuracy, False Positive Rate, Precision or Positive 
Predictive Value (PPV), Negative Predictive Value (NPV) 
and F 1-Score. To define these metrics, let: TP (true 
positive) stand for the number of regions correctly 
classified as lesion region, TN (true negative) be the 
number of regions correctly classified as healthy regions, 
FP (false positive) the number of regions wrongfully 
classified as lesion and FN (false negative) be the number 
of regions classified as healthy but which are in reality 
lesion regions. 


• Sensitivity or True Positive rate (TPR): 

TP 


TPR - 


TP + FN 


-xlOO 


False Positive Rate (FPR): 

FP 

FPR = xlOO 

FP + TN 

Specificity (SPC) or True Negative Rate: 


( 15 ) 


( 16 ) 










Graphics, Vision and Image Processing Journal, ISSN 1687-398X, Volume 16, Issue 3, ICGST LLC, Delaware, USA, Dec. 2016 


TN 


TPR = - 

FP + TN 

Accuracy (ACC): 

ACC = 


-xlOO 


TP+TN 


TP+FN + FP+TN 


(17) 


xlOO (18) 


Precision or Positive Predictive Value (PPV): 

TP 


PPV = 


FP+TP 


-xlOO 


(19) 


Training ROC 



Validation ROC 


False Positive Rate 


0 0.2 0.4 0.6 0 .( 

False Positive Rate 


• Negative Predictive Value (NPV): 

TN 

NPV = x 1 00 (20) 

FN + TN 

4 . Results and discussion 

Results: We got best performances with six neurons in 
hidden layer. With this configuration, the network 
achieved his best validation performance after 457 
iterations and the cross-entropy value of the objective 
function is around 0.083 (see Figure 6). 




False Positive Rate 


False Positive Rate 


Best Validation Performance is 0.083343 at epoch 457 



Figure 6. Training performance. 


Training Confusion Matrix 


Validation Confusion Matrix 



Target Class 


Target Class 


Test Confusion Matrix 


72747 

46.7% 


96.6% 

3.4% 


1811 

1.2% 


Target Class 



Target Class 


Figure 7. Confusion matrix. 


Figure 8. ROC Curves of the classifier. 

The classification performance is good, considering error 
rates shown by confusion matrix of figure 7. Indeed, 
confusion matrix help to know percentages of samples 
which are misclassified in each set. According to the 
confusion matrix, the False Match Rate (FMR) in both 
training, validation and test stages is 2.8%. Moreover, the 
performance of the obtained classifier is more illustrated 
by the ROC curves of Figure 8. The form of ROC curve 
shows us that we have an excellent classifier if we refer 
to [32]. Indeed, the greater the area under the curve 
(AUROCC) is, the better the classifier is [33]. Finally, 
the values of evaluation metrics are reported in Table II. 


Table II. Values of Evaluation Metrics 


Metrics 

Values 

True Positive Rate (TPR) or Sensitivity 

96,9% 

True Negative Rate or Specificity (SPC) 

97,5% 

False Positive Rate (FPR) 

2,5% 

Precision or Predictive Positive Value (PPV) 

97,7% 

Negative Predictive Value (NPV) 

96,6% 

Accuracy (ACC) 

97,2% 


Discussion: In our modelling, Is stand for normal or 
healthy skin region and 0s for lesion regions. Thus, we 
can notice that 3.1% of normal skin regions are 
misclassified; whereas only 2.5% of lesion regions are 
misclassified. These values are satisfactory since low 
error rate in classification of lesion regions is particularly 
important for future segmentation purpose. 

The sensitivity of the classifier is: 96,9 %; its Specificity 
is: 97,5% and the False Positive Rate (FPR) is: 2,5%. 
These values are significant, though poor to validate 
machine learning experiments [32]. So, to strengthen our 
results, the following values are considered: Precision or 
PPV: 97,7%; NPV: 96,6%; and Accuracy: 97,2%. Their 
high values (more than 95%) indicate a certain reliability. 
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However, our results need to be compared with results 
obtained by similar work, although there is no specific 
previous work with black skin in the literature, apart from 
ours published in [8]. For this purpose, two works have 
caught our attention: those of (Madan et al., 2011) and 
(Jafari et al., 2016). The first worked on the detection of 
acne-like regions in macroscopic medical images of face 
[13]. The second recently used deep learning to extract 
skin lesions from non-dermoscopic (macroscopic) images 
[22]. A comparison with our results is made in Table III 
on obtained values for three metrics used by both authors: 
best results are bold. 


Table III. Comparison of Detection performances in 
Macroscopic Skin Images 



Papers or Works 

Metrics 

Madan et 

Jafari et 

Our Work 

al., 2011 

al., 2016 

Sensitivity 

90,4 % 

95,2 % 

96,9 % 

Specificity 

88% 

98,8 % 

97,5 % 

Accuracy 

89,2 % 

98,5 % 

97,2 % 


Best results of specificity and accuracy are obtained by 
Jafari et al., whereas best sensitivity value is obtained by 
us. This shows that our recognition performances need to 
be improved, even if the gap is not very big. However, it 
is important to note that both works compared to ours, 
were performed on white skin. This could justify the best 
performances. Indeed, contrasts between healthy regions 
and lesions are much sharper in this kind of skin. 

5. Conclusion 

The major contribution of this article arise from the use 
of ANN for black skin lesions. Indeed, using MLP-ANN 
for recognition tasks is not new but finding the good 
learning features to achieve good performances is the 
main contribution of this research work. The trained 
classifier identifies lesion-like regions in black skin 
medical images. To do this, we first set up a dataset of 
size around 800.000 examples. Texture and color features 
have been calculated for each of these examples. After 
that, we trained the network, while varying the number of 
neurons in hidden layer in order to get best performances. 
Finally we obtained a very good classifier which is able 
to differentiate lesion regions from safe skin regions in 
medical images with a small error rate. 

This study also allowed us to highlight the differences 
between black skin and white skin. These differences 
exist firstly in terms of color, secondly in anatomical and 
structural terms. They induce difficulties to dermatology 
practitioners, as seen in publications made on the subject. 
Texture and color information used here as learning 
parameters, confirms practices of black skin specialists, 
who usually resort to skin palpation (searching texture 
changes) and observation of color changes to make their 
diagnostics. 

But, although the performances of the obtained classifier 
are good, error rate is not less considerable. This is why 
the developed technique is used in "artifact detection" 
stage. Thus, further processing have to be made on 
resulting images before segmentation. 
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